Manticore: A 4096-Core RISC-V Chiplet Architecture for Ultraefficient Floating-Point Computing

نویسندگان

چکیده

Data-parallel problems demand ever growing floating-point (FP) operations per second under tight area- and energy-efficiency constraints. In this work, we present Manticore, a general-purpose, ultraefficient chiplet-based architecture for data-parallel FP workloads. We have manufactured prototype of the chiplet’s computational core in Globalfoundries 22FDX process demonstrate more than 5x improvement energy efficiency on intensive workloads compared to CPUs GPUs. The compute capability at high area is provided “Snitch: A tiny pseudo dual-issue processor efficient execution workloads,” IEEE Trans. Comput., containing eight small integer cores, each controlling large unit (FPU). supports two custom ISA extensions: SSRs extension elides explicit load store instructions by encoding them as register reads writes (“Stream semantic registers: lightweight RISC-V achieving full utilization single-issue cores,” Comput.). repetition decouples from FPU allowing be issued independently. These extensions allow minimize its instruction fetch bandwidth saturate FPU, above 90%, with 40% dedicated FPU.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A 32-bit FPGA-based Single Precision Floating-point Hybrid CORDIC Processor Based on RISC Architecture

This paper presents the design process of a 32-bit single precision floating-point Hybrid Coordinate Rotation Digital Computer (CORDIC) processor on Field Programmable Gate Array (FPGA) which used to perform the mathematical computation operations for various elementary functions such as trigonometry and hyperbolic functions, exponential, natural logarithm, square root as well as multiplication...

متن کامل

Hardware accelerated approach for floating-point multiplication on 32-bit pipelined RISC-V processor

Implementing hardware support for all extensions of the RISC-V Instruction Set Architecture inside a processor would lead to avoidable area and power consumption for applications that rarely utilize a particular extension. In this paper, authors have first suggested a modified 3-stage pipeline alternative to the ZSCALE processor (32-bit) by UC Berkeley. Subsequently a hardware-accelerated appro...

متن کامل

A RISC-V Extension for the Fresh Breeze Architecture

We report on a RISC-V extension for a novel multi-core computer organization able to execute applications with high performance and energy efficiency. Novel features of this architecture include support for data objects represented by trees of 128-byte memory chunks, and hardware implementation of task scheduling and load balancing. We call our project Fresh Breeze1 in view of its novelty and p...

متن کامل

A Transprecision Floating-Point Platform for Ultra-Low Power Computing

In modern low-power embedded platforms, the execution of floating-point (FP) operations emerges as a major contributor to the energy consumption of compute-intensive applications with large dynamic range. Experimental evidence shows that 50% of the energy consumed by a core and its data memory is related to FP computations. The adoption of FP formats requiring a lower number of bits is an inter...

متن کامل

Labeled RISC-V: A New Perspective on Software-Defined Architecture

Traditional computer architectures are insufficient to convey important high-level requirements of applications to the hardware. These requirements include QoS and security, which are extremely important to data centers in the cloud era. To guarantee better QoS in data centers, we propose a new computer architecture LvNA (Labeled von Neumann Architecture) that leverages labeling mechanism and p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Micro

سال: 2021

ISSN: ['1937-4143', '0272-1732']

DOI: https://doi.org/10.1109/mm.2020.3045564